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We analyze cosmology assuming unitary quantum mechanics, using a tripartite partition into 
system, observer and environment degrees of freedom. This generalizes the second law of ther- 
modynamics to "The system's entropy can't decrease unless it interacts with the observer, and it 
can't increase unless it interacts with the environment. " We show that because of the long-range 
entanglement created by cosmological inflation, the cosmic entropy decreases exponentially rather 
than linearly with the number of bits of information observed, so that a given observer can re- 
duce entropy by much more than the amount of information her brain can store. Indeed, we argue 
that as long as inflation has occurred in a non-neglible fraction of the volume, almost all sentient 
observers will find themselves in a post-inflationary low-entropy Hubble volume, and we humans 
have no reason to be surprised that we do as well, which solves the so-called inflationary entropy 
problem. An arguably worse problem for unitary cosmology involves gamma-ray-burst constraints 
on the "Big Snap" , a fourth cosmic doomsday scenario alongside the "Big Crunch" , "Big Chill" and 
"Big Rip" , where an increasingly granular nature of expanding space modifies our life-supporting 
laws of physics. Our tripartite framework also clarifies when the popular quantum gravity approx- 
imation Gfj,v ~ 8nG{Tf_i^) is valid, and how problems with recent attempts to explain dark energy 
as gravitational backreaction from super-horizon scale fluctuations can be understood as a failure 
of this approximation. 



I. INTRODUCTION 

The spectacular progress in observational cosmology 
over the past decade has established cosmological infla- 
tion [IHl] as the most popular theory for what happened 
early on. Its popularity stems from the perception that it 
elegantly explains certain observed properties of our uni- 
verse that would otherwise constitute extremely unlikely 
fluke coincidences, such as why it is so flat and uniform, 
and why there are 10~^-level density fluctuations which 
appear adiabatic, Gaussian, and almost scale-invariant 

If a scientific theory predicts a certain outcome with 
probability below 10~^, say, then we say that the the- 
ory is ruled out at 99.9999% confidence if we nonethe- 
less observe this outcome. In this sense, the classic Big 
Bang model without inflation is arguably ruled out at 
extremely high significance. For example, generic initial 
conditions consistent with our existence 13.7 Billion years 
later predict observed cosmic background fluctuations 
that are about 10^ times larger than we actually observe 
[5] — the so-called horizon problem P]. In other words, 
without inflation, the initial conditions would have to be 
highly fine-tuned to match our observations. 

However, the case for inflation is not yet closed, even 
aside from issues to do with measurements [5], compet- 
ing theories P^UHT^ and the so-called measure problem 
[51 . In particular, it has been argued that the so- 

called "entropy problem" invalidates claims that inflation 
is a successful theory. This "entropy problem" was ar- 
ticulated by Penrose even before inflation was invented 
[34j , and has recently been clarified in an important body 
of work by Carroll and collaborators [35l |36| • The basic 
problem is to explain why our early universe had such 



low entropy, with its matter highly uniform rather than 
clumped into huge black holes. The conventional answer 
holds that inflation is an attractor solution, such that 
a broad class if initial conditions lead to essentially the 
same inflationary outcome, thus replacing the embarrass- 
ing need to explain extremely unusual initial conditions 
by the less embarrassing need to explain why our initial 
conditions were in the broad class supporting inflation. 
However, |36| argues that the entropy must have been 
at least as low before inflation as after it ended, so that 
inflation fails to make our state seem less unnatural or 
flne-tuned. This follows from the mapping between initial 
states and flnal states being invertible, corresponding to 
Liouville's theorem in classical mechanics and unitarity 
in quantum mechanics. 



The main goal of this paper is to investigate the en- 
tropy problem in unitary quantum mechanics more thor- 
oughly. We will see that this fundamentally transforms 
the problem, strengthening the case for inflation. Our 
flndings also have implications for quantum gravity re- 
search, by clarifying when the popular approximation 
Gfj.1, ~ 87rG'(T^i/) is and is not valid. The rest of this 
paper is organized as follows. In Section|llj we describe a 
quantitative formalism for computing the quantum state 
and its entropy in unitary cosmology. We apply this for- 
malism to the inflationary entropy problem in Section [TTT| 
and discuss implications in Section [TV] Details regarding 
the "Big Snap" scenario are covered to Appendix \K\ 
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FIG. 1: Because of chaotic dynamics, a single early-universe 
quantum state \'4>) typically evolves into a quantum superpo- 
sition of many macroscopically different states, some of which 
correspond to a large semiclassical post-inflationary universe 
like ours (each with its galaxies etc. in different places), and 
others which do not and completely lack observers. 



II. SUBJECT, OBJECT & ENVIRONMENT 

A. Unitary Cosmology 

The key assumption underlying the entropy problem is 
that quantum mechanics is unitary, so we will make this 
assumption throughout the present paper^ . As described 
in [40] , this suggests the history schematically illustrated 
in Figure [T] a wavefunction describing an early universe 
quantum state (illustrated by the fuzz at the far left) 
will evolve deterministically according to the Schrodinger 
equation into a quantum superposition of not one but 
many macroscopically different states, some of which cor- 
respond to large semiclassical post-inflationary universes 
like ours, and others which do not and completely lack 
observers. The argument of 40J basically went as follows: 

1. By the Heisenberg uncertainty principle, any ini- 
tial state must involve micro-superpositions, micro- 



The forms of non-unitarity historically invoked to address the 
quantum measurement problem tend to make the entropy prob- 
lem worse rather than better: both Copenhagen-style wavefunc- 
tion collapse |37l I38j and proposed dynamical reduction mecha- 
nisms 1391 arguably tend to increase the entropy, transforming 
pure (zero entropy) quantum states into mixed states, akin to a 
form of diffusion process in phase space. 



scopic quantum fluctuations in the various fields. 

2. Because the ensuing time-evolution involves insta- 
bilities (such as the well-known gravitational insta- 
bilities that lead to the formation of cosmic large- 
scale structure) , some of these micro-superpositions 
are amplified into macro-superpositions, much like 
in Schrodinger's cat example [41]. More generally, 
this happens for any chaotic dynamics, where pos- 
itive Lyapunov exponents make the outcome is ex- 
ponentially sensitive to initial conditions. 

3. The current quantum state of the universe is thus 
a superposition of a large number of states that are 
macroscopically different (Earth forms here. Earth 
forms one meter further north, etc), as well as states 
that failed to inflate. 

4. Since macroscopic objects inevitably interact with 
their surroundings, the well-known effects of deco- 
herence will keep observers such as us unaware of 
such macro-superpositions. 

This shows that with unitary quantum mechanics, the 
conventional phrasing of the entropy problem is too 
simplistic, since a single pre-inflationary quantum state 
evolves into a superposition of many different semiclas- 
sical post-inflationary states. The careful and detailed 
analysis of the entropy problem in [36 is mainly per- 
formed within the context of classical physics, and quan- 
tum mechanics is only briefly mentioned, when correctly 
stating that Liouville's theorem holds quantum mechan- 
ically too as long as the evolution is unitarity. However, 
the evolution that is unitary is that of the total quan- 
tum state of the entire universe. We unfortunately have 
no observational information about this total entropy, 
and what we casually refer to as "the" entropy is instead 
the entropy we observe for our particular branch of the 
wavefunction in Figure [TJ We should generally expect 
these two entropies to be quite different — indeed, the 
entropy of the entire universe may well equal zero, since 
if it started in a pure state, unitarity ensures that it is 
still in a pure state. 

B. Deconstructing the universe 

It is therefore interesting to investigate the cosmolog- 
ical entropy problem more thoroughly in the context of 
unitary quantum mechanics, which we will now do. 

Most discussions of quantum statistical mechanics split 
the Universe into two subsystems |42j: the object under 
consideration and everything else (referred to as the en- 
vironment). At a physical level, this "splitting" is simply 
a matter of accounting, grouping the degrees of freedom 
into two sets: those of the object and the rest. At a 
mathematical level, this corresponds to a choice of fac- 
torization of the Hilbert space. 

As discussed in |43j . unitary quantum mechanics can 
be even better understood if we include a third subsystem 
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FIG. 2: An observer can always decompose the world into 
three subsystems: the degrees of freedom corresponding to her 
subjective perceptions (the subject), the degrees of freedom 
being studied (the object), and everything else (the environ- 
ment). As indicated, the subsystem Hamiltonians Hs, Ho, 
He and the interaction Hamiltonians Hso, Hoc, H^c can cause 
qualitatively very different effects, providing a unified picture 
including both decoherence and apparent wave function col- 
lapse. Generally, Hoe increases entropy and -ffso decreases 
entropy. 

as well, the subject, thus decomposing the total system 
(the entire universe) into three subsystems: 

1. The subject consists of the degrees of freedom as- 
sociated with the subjective perceptions of the ob- 
server. This does not include any other degrees of 
freedom associated with the brain or other parts of 
the body. 

2. The object consists of the degrees of freedom that 
the observer is interested in studying, e.g., the 
pointer position on a measurement apparatus. 

3. The environment consists of everything else, i.e., 
all the degrees of freedom that the observer is not 
paying attention to. By definition, these are the 
degrees of freedom that we always perform a partial 
trace over. 

A related framework is presented in [131 03]. Note 
that the first two definitions are very restrictive. Sup- 
pose, for example, that you are measuring a voltage us- 
ing one of those old-fashioned multimeters that has an 
analog pointer. Then the "object" consists merely of the 
single degree of freedom corresponding to the angle of 
the pointer, and excludes all of the other ~ 10^^ degrees 
of freedom associated with the atoms in the multimeter. 



Similarly, the "subject" excludes most of the ~ 10^^ de- 
grees of freedom associated with the elementary particles 
in your brain. The term "perception" is used in a broad 
sense in item 1, including thoughts, emotions and any 
other attributes of the subjectively perceived state of the 
observer. 

This subject-object-environment decomposition of the 
degrees of freedom allows a corresponding decomposition 
of the Hamiltonian: 

H = Hg + Ho + H,, + + ^^so + Hoc + Hsoc, (1) 

where the first three terms operate only within one sub- 
system, the second three terms represent pairwise inter- 
actions between subsystems, and the third term repre- 
sents any irreducible three-way interaction. The practi- 
cal usefulness of this tripartite decomposition lies in that 
one can often neglect everything except the object and 
its internal dynamics (given by Ho) to first order, us- 
ing simple prescriptions to correct for the interactions 
with the subject and the environment, as summarized in 
Table 1. The effects of both Hso and Hoe have been ex- 
tensively studied in the literature. Hso involves quantum 
measurement, and gives rise to the usual interpretation 
of the diagonal elements of the object density matrix as 
probabilities. Hoc produces decoherence, selecting a pre- 
ferred basis and making the object act classically under 
appropriate conditions. Hsc, causes decoherence directly 
in the subject system. For example, [33] showed that any 
qualia or other subjective perceptions that are related to 
neurons firing in a human brain will decohere extremely 
rapidly, typically on a timescale of order 10"^" seconds, 
ensuring that our subjective perceptions will appear clas- 
sical. In other words, it is useful to split the Schrodinger 
equation into pieces: three governing the three parts of 
our universe (subject, object and environment), and ad- 
ditional pieces governing the interactions between these 
parts. Analyzing the effects of these different parts of 
the equation, the Ho part gives most of the effects that 
our textbooks cover, the Hgo part gives Everett's many 
worlds (spreading superpositions from the object to you, 
the subject), the Hoc part gives traditional decoherence, 
the Hse part gives subject decoherence. 

C. Entropy in quantum cosmology 

In the context of unitary cosmology, this tripartite de- 
composition is useful not merely as a framework for clas- 
sifying and unifying different quantum effects, but also 
as a framework for understanding entropy and its evolu- 
tion. In short, Hoc increases entropy while Hgo decreases 
entropy, in the sense defined below. 

To avoid confusion, it is crucial that we never talk of 
the entropy without being clear on which entropy we are 
referring to. With three subsystems, there are many in- 
teresting entropies to discuss, for example that of the 
subject, that of the object, that of the environment and 
that of the whole system, all of which will generally be 
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TABLE I: Summary of three three basic quantum processes discussed in the text 



Interaction 


Dynamics 


Example 


EfTect 


Entropy 


Object-object 

Object-environment 

Object-subject 


p ^ UpU'' 


( 


V 2 2 / 


\2 2 J 

(\ 0^ 

^0 -2) 

OS) 


1 


Unitary evolution 

Decoherence 

Observation 


Unchanged 

Increases 

Decreases 



different from one another. Any given observer can de- 
scribe the state of an object of interest by a density ma- 
trix Pa which is computed from the full density matrix p 
in two steps: 

1. Tracing: Partially trace over all environment de- 
grees of freedom. 

2. Conditioning: Condition on all subject degrees of 
freedom. 

In practice, step 2 often reduces to what textbooks call 
"state preparation" , as explained below. When we say 
"the entropy" without further qualification, we will refer 
to the object entropy S^- the standard von Neumann 
entropy of this object density matrix po, i-e., 



= -trpo logpo- 



(2) 



Below when we speak of the information (in bits) that one 
system (say the environment) has about another (say the 
object), we will refer to the quantum mutual information 
given by the standard definition 



(3) 



1\2 = Si + S2 ~ Si 



where S12 is the joint system, while Si and Si are the en- 
tropies of each subsystem when tracing over the degrees 
of freedom of the other. 

Let us illustrate all this with a simple example in Fig- 
ure[3] where both the subject and object have only a sin- 
gle degree of freedom that can take on only a few distinct 
values (3 for the subject, 2 for the object). For definite- 
ness, we denote the three subject states 1 1'), |^) and |^), 
and interpret them as the observer feeling neutral, happy 
and sad, respectively. We denote the two object states 
It) and ID, and interpret them as the spin component 
("up" or "down") in the z-direction of a spin-1/2 system, 
say a silver atom. The joint system consisting of subject 
and object therefore has only 2x3 = 6 basis states: | L'f), 
\^t), l^t), 1^;). In Figures Figure g we 

have therefore plotted p as a 6 x 6 matrix consisting of 
nine two-by-two blocks. 



1. Effect of Ho: constant entropy 

If the object were to evolve during a time interval t 
without interacting with the subject or the environment 
[Hso — Hoc — Hsoc — 0), then its reduced density matrix 
Po would evolve into UpoU^ with the same entropy, since 
the time-evolution operator U = e~'^°* is unitary. 

Suppose the subject stays in the state jl') and the 
object starts out in the pure state |t)- Let the object 
Hamiltonian Hq correspond to a magnetic field in the y- 
direction causing the spin to precess to the a;-direction, 
i.e., to the state (|t) + |4))/\/2. The object density matrix 
Po then evolves into 



Po = c/|t)(tit/^ = ^(it) + i;))((ti + (;i) 
= ;i(it>(ti + it>(;i + i;)(ti + i;>(;i), 



(4) 



corresponding to the four entries of 1/2 in the second 
matrix of Figure [3j 
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FIG. 3: Time evolution of the 6x6 density matrix for the 
basis states |l't), |ot)i |^t)) the object 

evolves in isolation, then decoheres, then gets observed by the 
subject. The final result is a statistical mixture of the states 
lot) a-nd l^i), simple zero-entropy states like the one we 
started with. 
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This is quite typical of pure quantum evolution: a ba- 
sis state eventually evolves into a superposition of basis 
states, and the quantum nature of this superposition is 
manifested by off-diagonal elements in p^. Another fa- 
miliar example of this is the familiar spreading out of the 
wave packet of a free particle. 

2. Effect of Hoc '■ increasing entropy 

This was the effect of Ho alone. In contrast, i/oc will 
generally cause decoherence and increase the entropy of 
the object. Although decoherence is now well-understood 
[3HH51j , we will briefly review some core results here that 
will be needed for the subsequent section about measure- 
ment. 

Let \oi) and je;) denote basis states of the object and 
the environment, respectively. As discussed in detail in 
[Sni ED, decoherence (due to i?oc) tends to occur on 
timescales much faster than those on which macroscopic 
objects evolve (due to -Ho)j making it a good approxima- 
tion to assume that the unitary dynamics is J7 = e~*^°"* 
on the decoherence timescale and leaves the object state 
unchanged, merely changing the environment state in a 
way that depends on the object state |oi), say from an 
initial state |eo) into some final state |ei): 

[/|eo)|o,) = |e.)|o,). (5) 

This means that an the initial density matrix p — 
|eo)(eo| ® Po of the object-environment system, where 
Po = Z]ij(oi|Po|oj)|oi)(oj|, win evolve as 

p ^ UpU^ ^U\eo){eo\PoU^ 

= ^(o.|p„|o,);7|eo)|o,)(eo|(o,|[/t 

The reduced density matrix for the object is this object- 
environment density matrix partial-traced over the envi- 
ronment, so it evolves as 

Po i-> treP = y^(efc|p|efc) 

k 

= ^{oi\Po\oJ){ej\ek){ek\e^)\o^){oj\ 

ijk 

- J2P.PoPj{eAer}, (7) 

where we used the identity X]fcl^fe)(^fel = ^ in the 
penultimate step and defined the projection operators 
Pi = \oi){oi\ that project onto the i*'^ eigenstate of the 
object. This well-known result implies that if the envi- 
ronment can tell whether the object is in state i or j, i.e.. 



if the environment reacts differently in these two cases by 
ending up in two orthogonal states, {ej\ei} = 0, then the 
corresponding (j, j)-element of the object density matrix 
gets replaced by zero: 

i 

corresponding to the so-called von Neumann reduction 
[53] which was postulated long before the discovery of 
decoherence; we can interpret it as object having been 
measured by something (the environment) that refuses 
to tell us what the outcome was.^ 

This suppression of the off-diagonal elements of the 
object density matrix is illustrated in Figure [Sj In this 
example, we have only two object states |oi) = |t) and 
I02) = \D, two environment states, and an interaction 
such that (ei|e2) = 0, giving 

po^^(it)(ti + i;)(ii. (9) 

This new final state corresponds to the two entries of 1 /2 
in the third matrix of Figure |3] In short, when the envi- 
ronment finds out about the system state, it decoheres. 



3. Effect of Hso : decreasing entropy 

Whereas Hoc typically causes the apparent entropy of 
the object to increase, Hso typically causes it to decrease. 
Figure [3] illustrates the case of an ideal measurement, 
where the subject starts out in the state 1 1' ) and Hgo 
is of such a form that the subject gets perfectly corre- 
lated with the object. In the language of equation ([3|, an 
ideal measurement is a type of communication where the 
mutual information I^o between the subject and object 
systems is increased to its maximum possible value pS]. 
Suppose that the measurement is caused by Hso becom- 
ing large during a time interval so brief that we can ne- 
glect the effects of Hs and Ho- The joint subject-|-object 
density matrix pso then evolves as pso U psoU\ where 
U = exp \~i J Hsodt\. If observing |t) makes the sub- 
ject happy and \\.) makes the subject sad, then we have 
U\:t) = lot) and U\:i) = The state given by 



^ Equation | |34| is known as the Liiders projection I54| for the more 
general case where the Pi are more general projection operators 
that still satisfy PiPj = SijPi, — ^- This form also follows 

from the decoherence formula ^ for the more general case where 
the environment can only tell which group of states the object 
is in (because the eigenvalues of Hoe are degenerate within each 
group), so that {ej\ei) = 1 if i and j belong to the same group 
and vanishes otherwise. One then obtains an equation of the 
same form as equation (jsj, but where each projection operator 
projects onto one of the groups. 
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equation (|9| would therefore evolve into 

Po = \u{\"){-:\)®{W){^\ + \i){m^ 
= \{uy:^){:w^ + u[:i){:wu^ 



(10) 



(l^t)(^tl 



4)(^; I) = 2 (m +p© 



as illustrated in Figure [sj where p@ = |ot)(ot| and 
p@ = |^|)(^| |. This final state contains a mix- 
ture of two subjects, corresponding to definite but oppo- 
site knowledge of the object state. According to both of 
them, the entropy of the object has decreased from one 
bit to zero bits. As mentioned above, there is a sepa- 
rate object density matrix corresponding to each of 
these two observers. Each of these two observers picks 
out her density matrix by conditioning the density ma- 
trix of equation ( 10 ) on her subject degrees of freedom, 
i.e., the density matrix of the happy one is pQ and that 
of the other one is p@ . These are what Everett termed 
the "relative states" [46], except that we are expressing 
them in terms of density matrices rather than wavefunc- 
tions. In other words, a subject by definition has zero 
entropy at all times, subjectively knowing her state per- 
fectly. Related discussion of the conditioning operation 
is given in [131 Si] • 

In many experimental situations, this projection step 
in defining the object density matrix corresponds to the 
familiar textbook process of quantum state preparation. 
For example, suppose an observer wants to perform a 
quantum measurement on a spin 1/2 silver atom in the 
state It). To obtain a silver atom prepared in this state, 
she can simply perform the measurement of one atom, 
introspect, and if she finds that she is in state then 
she know that her atom is prepared in the desired state 
It) — otherwise she discards it and tries again with other 
atom until she succeeds. Now she is ready to perform her 
experiment. 

In cosmology, this state preparation step is often so ob- 
vious that it is easy to overlook. Consider for example the 
state illustrated in Figure [T] and ask yourself what den- 
sity matrix you should use to make predictions for your 
own future cosmological observations. All experiments 
you can ever perform are preceded by you introspecting 
and implicitly confirming that you are not in one of the 
stillborn galaxy-free wavefunction branches that failed to 
inflate. Since those dead branches are thoroughly deco- 
hered from the branch that you are in, they are com- 
pletely irrelevant to predicting your future, and it would 
be a serious mistake not to discard their contribution to 
the density matrix of your universe. This conditionaliza- 
tion is analogous to the use of conditional probabilities 
when making predictions in classical physics. If you are 
playing cards, for example, the probabilistic model that 
you make for your opponents hidden cards reflects your 
knowledge of your own cards; you do not consider shuf- 
fling outcomes where you were dealt different cards than 
those you observe. 



Just as decoherence can be partial, when {ej\ei) ^ 0, 
so can measurement, so let us now derive how observa- 
tion changes the density matrix also in the most general 
case. Let \si) denote the basis states that the subject can 
perceive — as discussed above, these must be robust to 
decoherence, and will for the case of a human observer 
correspond to "pointer states" [SS] of certain degrees of 
freedom of her brain. Just as in the decoherence section 
above, let us consider general interactions that leave the 
object unchanged, i.e., such that the unitary dynamics is 
U = e~*^=°* during the observation and merely changes 
the subject state in a way that depends on the object 
state \oi), say from an initial state |so) into some flnal 
state |(Ti): 



U\so)\Oi) = |f7,;)|Oi). 



(11) 



This means that an initial density matrix p 
|'So)(so| ^ Po of the subject-object system, where po 
J2ij{oi\Po\oj)\oi){oj\, wiU evolve as 

p ^ UpU^ =U\so){so\poU^ 



X](odPo|Oj)|CT»)|Oz)(crj|(Oj 



(12) 



Since the subject will decohere rapidly, on a timescale 
much shorter than that on which subjective perceptions 
change, we can apply the decoherence formula ([s]) to this 
expression with Pi — \si){si\, which gives 

p ^ ^PkpPk ^^\sk}{sk\p\sk}{sk\ 

k k 

= ^{Oi\Po\Oj){Sk\(Ji){(Jj\Sk)\Sk){Sk\ «) \Oi){Oj\ 
ijk 

- Y.\sk){sk\^pi''\ (13) 

k 

where 

pi''^ = ^{oi\po\o.j){sk\ai){aj\sk)\o,){oj\ 

= Y,P^,PoP,{sk\aC){sk\a,Y (14) 

is the (unnormalized) density matrix that the subject 
perceiving \sk) will experience. Equation (13) thus de- 



scribes a sum of decohered components, each of which 
contains the subject in a pure state \sk)- For the version 
of the subject perceiving \sk), the correct object density 
matrix to use for all its future predictions is therefore 
pi*'' appropriate re-normalized to have unit trace: 
{k 

Po 



Po 



Y.ijPiPoP3{sk\a,){skW,Y 



trHfepn^. 



E,trp„P,|(sfe|a.)|2 



(15) 



7 



where 



Ilk 



Sk\(yi)Pi- 



(16) 



This can be thought of as a generahzation of Everett's 
so-called relative state from wave functions to density 
matrices and from complete to partial measurements. 

(k) 

We recognize the denominator tr po = 
J2i{'^i\Po\oi)\{sk\o'i)\'^ as the standard expression 
for the probability that the subject will perceive \sk)- 
Note that the same final result in equation ( 15 ) can 
also be computed directly from equation ( |12[ ) without 
invoking decoherence, as po (sfe|p|sfc)/tr (sfe|p|sfc), so 
the role of decoherence lies merely in clarifying why this 
is the correct way to compute the new p^- 

To better understand equation (15), let us consider 
some simple examples: 



1. If 



Si (To- 



then we have a perfect measure- 



ment in the sense that the subject learns the exact 



object state, and equation ( 15 ) reduces to Po^ Pk, 
i.e.,. the observer perceiving \sk) knows that the 
object is in its fc*^ eigenstate. 

2. If \ai) is independent of i, then no information 
whatsoever has been transmitted to the subject, 
and equation (15) reduces to po ^ Po, *-e., nothing 
changes. 

3. If for some subject state k we have {si\aj) = 1 for 
some group of j-values, vanishing otherwise, then 
the observer knows only that the object state is in 
this group (this can ha ppe n if H^^ has degenerate 



eigenvalues). Equation ( |l5| then reduces to j^^^^, 
where 11^ is the projection operator onto this group 
of states. 



4- Entropy and information 

In summary, we see that the object decreases its en- 
tropy when it exchanges information with the subject and 
increases it when it exchanges information with the envi- 
ronment. Since the standard phrasing of the second law 
of thermodynamics is focused on the case where interac- 
tions with the observer are unimportant, we can rephrase 
it in a more nuanced way that explicitly acknowledges 
this caveat: 



Second law of thermodynamics: 

The object's entropy can't decrease unless it 
interacts with the subject. 



We can also formulate an analogous law that focuses 
on the observation process and ignores decoherence: 



Another law of thermodynamics: 

The object's entropy can't increase unless it 
interacts with the environment. 




FIG. 4: Our toy model involves a pixelized space where pixels 
are habitable (green/light grey) or uninhabitable (red/dark 
grey) at random with probability 50%, except inside large 
contiguous inflationary patches where all pixels are habitable. 



For a less cosmological example, consider Helium gas in 
a thermally insulated box, starting off with the gas parti- 
cles in a zero-entropy coherent state, where each atom is 
in a rather well-defined position. There are positive Lya- 
punov exponents in this system because the momentum 
transfer in atomic collisions is sensitive to the impact pa- 
rameter, so before long, chaotic dynamics has placed ev- 
ery gas particle in a superposition of being everywhere in 
a box — indeed, in a superposition of being all over phase 
space, with a Maxwell-Boltzmann distribution. If we de- 
fine the object to be some small subset of the Helium 
atoms and call the rest of the atoms the environment, 
then the object entropy So will be high (corresponding 
to a roughly thermal density matrix p^ oc e~^/^'^) even 
though the the total entropy Soe remains zero; the differ- 
ence between these two entropies reflects the information 
that the environment has about the object via quantum 
entanglement as per equation ([3|. In classical thermo- 
dynamics, the only way to reduce the entropy of a gas 
is to invoke Maxwell's demon. Our formalism provides a 
different way to understand this: the entropy decreases if 
you yourself are the demon, obtaining information about 
the individual atoms that constitute the object. 
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III. APPLICATION TO THE INFLATIONARY 
ENTROPY PROBLEM 



A. A classical toy model 

To build intuition for the effect of observation on en- 
tropy in inflationary cosmology, let us consider the simple 
toy model illustrated in Figure [4] This model is purely 
classical, but we will show below how the basic conclu- 
sions generalize to the quantum case as well. We will also 
see that the qualitative conclusions remain valid when 
this unphysical toy model is replaced by realistic infla- 
tion scenarios. 

Let us imagine an infinite space pixelized into dis- 
crete voxels of finite size, each of which can be in only 
two states. We will refer to these two states as habit- 
able and uninhabitable, and in Figure |4] they are colored 
green/light grey and red/dark grey, respectively. We as- 
sume that some inflation-like process has created large 
habitable patches in this space, which fill a fraction / of 
the total volume, and that the rest of space has a com- 
pletely random state where each voxel is habitable with 
50% probability, independently of the other voxels. 

Now consider a randomly selected region (which we 
will refer to as a "universe" by analogy with our Hub- 
ble volume) of this space, lying either completely in- 
side an inflationary patch or completely outside the in- 
flationary patches — almost all regions much smaller 
than the typical inflationary patch will have this prop- 
erty. Let us number the voxels in our region in some or- 
der 1, 2,3, and let us represent each state by a string 
of zeroes and ones denoting habitable and uninhabit- 
able, where a in the i*^ position means that the i^^ 
voxel is habitable. For example, if our region contains 
30 voxels, then "000000000000000000000000000000" de- 
notes the state where the whole region is habitable, 
whereas "101101010001111010001100101001" represents 
a rather typical non-inflationary state. Finally, we label 
each state by an integer i which is simply its bit string 
interpreted as a binary number. 

Letting n denote the number of voxels in our region, 
there are clearly 2" possible states i = 0, 2" — 1 that 
it can be in. By our assumptions, the probability pi that 
our region is in the i^^ state (denoted Ai) is 



P^ = P{A,) 



/ + (l-/)2- 
(l-/)2-" 



if i = 0, 
if i > 0, 



(17) 



i.e., there is a probability / of being in the i = Q state 
because inflation happened in our region, plus a small 
probability 2~" of being in any state in case inflation did 
not happen here. 

Now suppose that we decide to measure b bits of infor- 
mation by observing the state of the first b voxels. The 
probability P{H) that they are all habitable is simply 



the total probability of the first 2" ^ states, i.e. 



P{H) = J2 P. = / + (l-/)2-" + (2"-^-l)(l-/)2- 



i=0 



/ + (l-/)2 



-6 



(18) 



independent of the number of voxels n in our region. This 
result is easy to interpret: either we are in an inflationary 
region (with probability /), in which case these b voxels 
are all habitable, or we are not (with probability 1 — /), 
in which case they are all habitable with probability 2^^. 

If we find that these b voxels are indeed all habitable, 
then using the standard formula for conditional proba- 
bilities, we obtain the following revised probability dis- 
tribution for the state of our region: 



P{A\H) = 



/+(l-/)2- 
/+(l-/)2- 
(l-/)2 







-/)2 



P{H) 
if i = 0, 

— if i = 1, . 

if i = 2"- 



..,2"-'' - 1, 
^...,2"- 1. 



(19) 



We are now ready to compute the entropy S of our re- 
gion given various degrees of knowledge about it, which 
is defined by the standard Shannon formula 



Z — i 



i=0 



h[p) = -plog^p, (20) 



where we use logarithms of base two so that the entropy 
has units of bits. Consider first the simple case of no 
infiation, f — Q. Then all non-vanishing probabilities 



reduce to p. 



(b) _ Ob- 



and the entropy is simply 
S'('') =n-b. 



(21) 



In other words, the state initially requires n bits to de- 
scribe, one per voxel, and whenever we observe one more 
voxel, the entropy drops by one bit: the one bit of infor- 
mation we gain tells us merely about the state of the ob- 
served voxel, and tells us nothing about the rest of space 
since the other voxels are statistically independent. 

More generally, substituting equation ( 19 1 into equa- 
tion (20) gives 



S^") = h 



/ + (l-/)2- 



/ + (l-/)2-^ 



(l-/)2- 



^(^""-^)n/+(i-/)2- 

(22) 

As long as the number of voxels is large (n 3> b) and 
the inflated fraction / is non-negligible (/ ^ 2^"), this 
entropy is accurately approximated by 



sib) 



f 



yn—b 



/ + (1 - 1)2-" 
, /!(/)+ 2-^/1(1-/) 



h 



(23) 



2"/ 
1-/ 



/ + (l-/)2 



-6 



(1 - /)2-" 

/ + (1 - 

log [/ + (1 - f)2-'] 
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20 30 40 

Number of voxels observed 



50 



FIG. 5: How observations change the entropy for an inflation- 
ary fraction / = 0.5. If successive voxels are aU observed to be 
habitable, the entropy drops roughly exponentially in accor- 
dance with equation (251 (green/grey dots). If the first voxel 
is observed to be uninhabitable, thus establishing that we are 
in a non-inflationary region, then the entropy instead shoots 
up to the line of slope —1 given by equation ( |21[ ) (grey/red 
squares). More generally, we observe b habitable voxels and 
then one uninhabitable one, the entropy first follows the dots, 
then jumps up to the squares, then follows the squares down- 
ward regardless of what is observed thereafter. This figure 
illustrates the case with n = 50 voxels — although n ~ 10^^" 
is more relevant to our actual universe, the the drop toward 
zero of green curve would be too fast to be visible in the such 
a plot. 



The sum of the last two terms is merely an n-independent 
constant of order unity which approaches zero as we ob- 
serve more voxels (as b increases), so in this hmit, equa- 
tion ( 23 ) reduces to simply 



Si") 



2b 



(24) 



For the special case / = 1/2 where half the volume is 
inflationary, equation (23) reduces to the more accurate 
result 



Sib) 



2b + 1 



log[l 



(25) 



without approximations. 

Comparing equation ( [2T| ) with either of the last two 
equations, we notice quite a remarkable difference, which 
is illustrated in Figure [5] in the inflationary case, the 
entropy decreases not linearly (by one bit for every bit 
observed) , but exponentially! This means that in our toy 



inflationary universe model, if an observer looks around 
and finds that even a tiny nearby volume is habitable, 
this dramatically reduces the entropy of her universe. For 
example, if / = 0.5 and there are VS^^'^ voxels, then the 
initial entropy is about lO^^'' bits, and observing merely 
400 voxels (less than a fraction 10^^^^ of the volume) to 
be habitable brings this huge entropy down to less than 
one bit. 

How can observing a single voxel have such a large 
effect on the entropy? The answer clearly involves the 
long-range correlations induced by inflation, whereby this 
single voxel carries information about whether inflation 
occurred or not in all the other voxels in our universe. If 
we observe h ^ — log / habitable voxels, it is exponen- 
tially unlikely that we are not in an inflationary region. 
We therefore know with virtual certainty that the vox- 
els that we will observe in the future are also habitable. 
Since our uncertainty about the state of these voxels has 
largely gone away, the entropy must have decreased dra- 
matically, as equation (24) confirms. 

To gain more intuition for how this works, consider 
what happens if we instead observe the first b voxels to 
be uninhabitable. Then equation ( 19 1 instead makes all 
non- vanishing probabilities pi = 2""", and we recover 
equation (21 1 even when f ^ 0. Thus observing merely 



the first voxel to be uninhabitable causes the entropy to 
dramatically increase, from (1 — f)n to n — 1, roughly 
doubling if / = 0.5. We can understand all this by re- 
calling Shannon's famous result that the entropy gives 
the average number of bits required to specify an out- 
come. If we know that our universe is not inflationary, 
then we need a full n bits of information to specify the 
state of the n voxels, since they are all independent. If we 
know that our universe is inflationary, on the other hand, 
then we know that all voxels are habitable, and we need 
no further information. Since a a fraction (1 — /) of the 
universes are non-inflationary, we thus need (1 — f)n bits 
on average. Finally, to specify whether it is inflationary 
or not, we need 1 bit of information if / = 1/2 and more 
generally the slightly smaller amount h{f) + h{l — /), 
which is the entropy of a two-outcome distribution with 
probabilities / and 1 — f. The average number of bits 
needed to specify a universe is therefore 



5(")«(l-/)n + /!(/) + Ml-/), 



(26) 



which indeed agrees with equation ( |23[ ) when setting b = 
0. 

In other words, the entropy of our universe before we 
have made any observations is the average of a very large 
number and a very small number, corresponding to in- 
flationary and non-inflationary regions. As soon as we 
start observing, this entropy starts leaping towards one 
of these two numbers, reflecting our increased knowledge 
of which of the two types of region we inhabit. 

Finally, we note that the success in this inflationary 
explanation of low entropy does not require an extreme 
anthropic selection effect where life is a priori highly un- 
likely; contrariwise, the probability that our entire uni- 
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verse is habitable is simply /. and the effect works fine 
also when / is of order unity. 



B. The quantum case 

To build further intuition for the effect of observation 
on entropy, let us generalize our toy model to include 
quantum mechanics. We thus upgrade each voxel is a 2- 
state quantum system, with two orthogonal basis states 
denoted |0) ("habitable") and |1) ("uninhabitable"). The 
Hilbert space describing the quantum state of an n- voxel 
region thus has 2" dimensions. We label our 2" basis 
states by the same bit strings as earlier, so the state of 
the 30- voxel example given in Section |III A| above would 
be written 



1101101010001111010001100101001), (27) 



corresponding to basis state i = 759669545. If the region 
is inflationary, all its voxels are habitable, so its density 
matrix is 



Pycs = |000...0)(000...0| 



(28) 



If it is not inflationary, then we take each voxel to be in 
the mixed state 



p* 



|1>(1| 



(29) 



independently of all the other voxels, and the density 
matrix p^o of the whole region is simply a tensor product 
of n such single-voxel density matrices. In the general 
case that we wish to consider, there is a probability / 
that the region is inflationary, so the full density matrix 
is 

P = fPycs + (1 - f)Pno (30) 

= /|000 ... 0) (000 . . . 0| + (1 - /)p* (g) ® (g) . . . 

Expanding the tensor products, it is easy to show that we 
get 2" different terms, and that this full density matrix 
can be rewritten in the form 



(31) 



where are the probabilities given by equation (17 1 



Now suppose that we, just as in the previous section, 
decide to measure b bits of information by observing the 
state of the first b voxels and find them all to be habit- 
able. To compute the resulting density matrix p^'^\ we 
thus condition on our observational results using equa- 
tion (15) with the projection matrix P — |0...0) (0...0|, 



with b occurrences of inside each of the two brackets, 
obtaining 



PpP 
tr PpP ■ 



(32) 



Substituting equation (31) into this expression and per- 



forming some straightforward algebra gives 

2"-l 



(33) 



i=0 



where pf"^ are the probabilities given by equation ( 19 ) 
We can now compute the quantum entropy S of our re- 
gion, which is defined by the standard von Neuman for- 
mula 



5"'' =tTh 



(fc) 



Hp) = -piog2 p, 



(34) 



where we again use logarithms of base two so that the 
entropy has units of bits. This trace is conveniently eval- 
uated in the |'0i)-basis where equation (33) shows that 
the density matrix p^''^ is diagonal, reducing the entropy 
to the sum 



Z —1 



Pi 



(35) 



Comparing this with equation (20), we see that this re- 



sult is identical to the one we derived for the classical 
case. In other words, all conclusions we drew in the pre- 
vious section generalize to the quantum-mechanical case 
as well. 



C. Real- world issues 

Although we repeatedly used words like "inflation" and 
"inflationary" above, our toy models of course contained 
no inflationary physics whatsoever. For example, real 
eternal inflation tends to produce a messy spacetime with 
significant curvature on scales far beyond the cosmic hori- 
zon, not simply large uniform patches embedded in Eu- 
clidean space'^, and real infiation has quantum field de- 
grees of freedom that are continuous rather than simple 
qubits. However, it is also clear that our central result re- 
garding exponential entropy reduction has a very simple 
origin that is independent of such physical details: long- 
range entanglement. In other words, the key was simply 
that the state of a small region could sometimes reveal 
the state of a much larger region around it (in our case, 
local smoothness implied large-scale smoothness). This 
allowed a handful of measurements in that small region 
to, with a non-ncgiigible probability, provide a massive 
entropy reduction by revealing that the larger region was 



^ It is challenging to quantify the inflationary volume fraction / in 
such a messy spacetime, but as we saw above, this does not affect 
the qualitative conclusions as long as / is not exponentially small 
— which appears unlikely given the tendency of eternal inflation 
to dominate the total volume produced. 
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in a very simple state. We saw that the result was so ro- 
bust that it did not even matter whether this long-range 
entanglement was classical or quantum-mechanical. 

It is not merely inflation that produces such long-range 
entanglement, but any process that spreads rapidly out- 
ward from scattered starting points. To illustrate this 
robustness to physics details, consider the alternative ex- 
ample where Figure |4] is a picture of bacterial colonies 
growing in a Petri dish: the contiguous spread of colonies 
creates long-range entanglement, so that observing a 
small patch to be colonized makes it overwhelmingly 
likely that a much larger region around it is colonized. 
Similarly, if you discover that a drop of milk tastes sour, it 
is extremely likely that a much larger volume (your entire 
milk carton) is sour. A random bacterium in a milk car- 
ton should thus expect the entire carton be sour just like 
a random cosmologists in a habitable post-inflationary 
patch of space should expect her entire Hubble volume 
to be post-inflationary. 

IV. DISCUSSION 

In the context of unitary cosmology, we have investi- 
gated the time-evolution of the density matrix with which 
an observer describes a quantum system, focusing on the 
processes of decoherence and observation and how they 
change entropy. Let us now discuss some implications of 
our results for inflation and quantum gravity research. 

A. Implications for inflation 

Although inflation has emerged as the most popular 
theory for what happened early on, bolstered by im- 
proved measurements involving the cosmic microwave 
background and other cosmological probes, the case for 
inflation is certainly not closed. Aside from issues to do 
with measurements [H] and competing theories [TUHH], 
there are at least four potentially serious problems with 
its theoretical foundations, which are arguably interre- 
lated: 

1. The entropy problem 

2. The measure problem 

3. The start problem 

4. The degree-of-freedom problem 

Since we described the entropy problem in the introduc- 
tion, let us now briefly discuss the other three. 

1. The measure problem 

Inflation is generically eternal, producing a messy 
spacetime with infinitely many post-inflationary pockets 
separated by regions that infiate forever |5S1 - [55| . These 



pockets together contain an infinite volume and infinitely 
many particles, stars and planets. Moreover, certain ob- 
servable quantities like the density fluctuation amplitude 
that we have observed to be Q ^ 2 x 10~^ in our part of 
spacetime j8j[59j take different values in different places.^ 
Taken together, these two facts create what has become 
known as the inflationary "measure problem" [SJ I13H33) : 
the predictions of inflation for certain observable quanti- 
ties are not definite numbers, merely probability distri- 
butions, and we do not yet know how to compute these 
distributions. 

The failure to predict more than probability distribu- 
tions is of course not a problem per se, as long as we 
know how to compute them (as in quantum mechanics). 
In inflation, however, there is still no consensus around 
any unique and well-motivated framework for computing 
such probability distributions despite a major community 
effort in recent years. The crux of the problem is that 
when we have a messy spacetime with infinitely many 
observers who subjectively feel like you, any procedure 
to compute the fraction of them who will measure say 
one Q-value rather than another will depend on the or- 
der in which you count them, just as the fraction of the 
integers that are even depends on the order in which you 
count them [S]. There are infinitely many such observer 
ordering choices, many of which appear reasonable yet 
give manifestly incorrect predictions [5| HOI HSl 1311 133] j 
and despite promising developments, the measure prob- 
lem remains open. A popular approach is to count only 
the finite number of observers existing before a certain 
time t and then letting t — >■ oo, but this procedure has 
turned out to be extremely sensitive to the choice of time 
variable t in the spacetime manifold, with no obviously 
correct choice O |20l EH EU |33] , The measure problem 
has eclipsed and subsumed the so-called fine tuning prob- 
lem, in the sense that even the rather special inflaton po- 
tential shapes that are required to match observation can 
be found in many parts of the a messy multidimensional 
inflationary potential suggested by the string landscape 
scenario with its 10^"° or more distinct minima |60,- 6"i) ■ 
so the question shifts from asking why our inflaton po- 
tential is the way it is to asking what the probability is 
of flnding yourself in different parts of the landscape. 

In summary, until the measure problem is solved, infla- 
tion strictly speaking cannot make any testable predic- 
tions at all, thus failing to qualify as a scientiflc theory 
in Popper's sense. 



* Q depends on how the inflaton field rolled down its potential, so 
for a 1-dimensional potential with a single minimum, Q is gener- 
ically different in regions where the field rolled from the left and 
from the right. If there potential has more than one dimension, 
there is a continuum of options, and if there are multiple min- 
ima, there is even the possibility that other effective parameters 
(physical "constants") may differ between different minima, as 
in the string theory landscape scenario |60ti64| . 
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2. The start problem 

Whereas the measure problem stems from the end of 
inflation (or rather the lack thereof), a second problem 
stems from the beginning of inflation. As shown by 
Borde, Guth & Vilenkin [5S], inflation must have had 
a beginning, i.e., cannot be eternal to the past (except 
for the loophole described in [SSI [HZ]), so inflation fails to 
provide a complete theory of our origins, and needs to be 
supplemented with a theory of what came before. (The 
same applies to various ekpyrotic and cyclic universe sce- 
narios [55].) 

The question of what preceded inflation is wide open, 
with proposed answers including quantum tunneling 
from nothing [56l [68] , quantum tunneling from a "pre- 
big-bang" string perturbative vacuum [BH] [7D] and quan- 
tum tunneling from some other non-inflationary state. 
Whereas some authors have argued that eternal infla- 
tion makes predictions that are essentially independent 
of how inflation started, others have argued that this is 
not the case [TTMTSj . Moreover, there is no quantitative 
agreement between the probabilities predicted by differ- 
ent scenarios, some of which even differ over the sign of 
a huge exponent. 

The lack of consensus about the start of inflation not 
only undermines claims that inflation provides a final an- 
swer, but also calls into question whether some of the 
claimed successes of inflation really are successes. In the 
context of the above-mentioned entropy problem, some 
have argued that tunneling into the state needed to start 
inflation is just as unlikely as tunneling straight into the 
current state of our universe [351 136] , whereas others have 
argued that inflation still helps by reducing the amount 
of mass that the quantum tunneling event needs to gen- 
erate [73] . 

3. The degree- of -freedom problem 

A third problem facing inflation is to quantum- 
mechanically understand what happens when a region 
of space is expanded indeflnitely. We discuss this issue in 
detail in Appendix [X] below, and provide merely a brief 
summary here. Quantum gravity considerations suggest 
that the number of quantum degrees of freedom in a 
comoving volume V is flnite. If A'^ increases as this vol- 
ume expands, then we need an additional law of physics 
that specifies when and where new degrees of freedom are 
created, and into what quantum states they are born. If 
N does not increase, on the other hand, life as we know 
it may eventually be destroyed in a "Big Snap" when 
the increasingly granular nature of space begins to alter 
our effective laws of particle physics, much like a rubber 
band cannot be stretched indefinitely before the granu- 
lar nature of its atoms cause our continuum description 
of it to break down. Moreover, in the simplest scenarios 
where the number of observers is proportional to post- 
inflationary volume, such Big Snap scenarios are already 



ruled out by dispersion measurements using gamma ray 
bursts. In summary, none of the three logical possibilities 
for the number of quantum degrees of freedom A^ (it is 
infinite, it changes, it stays constant) is problem free. 

4- The case for inflation: the bottom line 

In summary, the case for inflation will continue to lack 
a rigorous foundation until the measure problem, the 
start problem and the degree-of-freedom problem have 
been solved, so until then, we cannot say for sure whether 
inflation solves the entropy problem and adequately ex- 
plains our low observed entropy. However, our results 
have shown that inflation certainly makes things better. 
We have seen that claims to the contrary are based on 
an unjustified neglect of the density matrix conditioning 
requirement (the third dynamical equation in Table 1), 
thus conflating the entropy of the full quantum state with 
the entropy of subsystems. 

Speciflcally, we have showed that by producing a quan- 
tum state with long-range entanglement, inflation creates 
a situation where observations can cause an exponential 
decrease in entropy, so that merely a handful of quan- 
tum measurements can bring the entropy for our observ- 
able universe down into the low range that we in fact 
observe. This means that if we assume that sentient ob- 
servers require at least a small volume (say enough to fit 
a few atoms) of low temperature (^ 10^^ GeV), then al- 
most all sentient observers will find themselves in a post- 
infiationary low-entropy universe, and we humans have 
no reason to be surprised that we do as well. 

B. Implications for quantum gravity 

We saw above that unjustified neglect of the density 
matrix conditioning requirement (the third dynamical 
equation in Table 1) can lead to incorrect conclusions 
about inflation. The bottom line is that we must not 
conflate the total density matrix with the density matrix 
relevant to us. Interestingly, as we will now describe, 
this exact same conflation has led to various incorrect 
claims in the the literature about quantum gravity and 
dark energy, for example that dark energy is simply back- 
reaction from super-horizon quantum fluctuations. 

1. Is G^^ « 87rG{r^,) ? 

Since we lack a complete theory of quantum gravity, we 
need some approximation in the interim for how quan- 
tum systems gravitate, generalizing the Einstein equa- 
tion Gpi/ = STrGT^ti/ of General Relativity. A common 
assumption in the literature is that to a good approxi- 
mation, 

G^, = 8^G(r^,), (36) 
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where G^^ on the left-hand-side is the usual classical Ein- 
stein tensor specifying spacetime curvature, while (T^i/) 
on the right-hand-side denotes the expectation value of 
the quantum field theory operator T^zy, i.e., {T^u) = 
tr[/9T^i,], where p is the density matrix. Indeed, this 
assumption is often (as in some of the examples cited be- 
low) made without explicitly stating it, as if its validity 
were self-evident. 



So is the approximation of equation ( 36 ) valid? It 



clearly works well in many cases, which is why it contin- 
ues to be used. Yet it is equally obvious that it cannot 
be universally valid. Consider the the simple example 
of inflation with a quadratic potential starting out in a 
homogeneous and isotropic quantum state. This state 
will qualitatively evolve as in Figure [l] into a quantum 
superposition of many macroscopically different states, 
some of which correspond to a large semiclassical post- 
inflationary universe like ours (each with its planets etc. 
in different places). Yet since both the initial quantum 
state and the evolution equations have translational and 
rotational invariance, the final quantum state will too, 
which means that (T^^) is homogeneous and isotropic. 
But equation ( 36 1 then implies that G^^ is homogeneous 
and isotropic as well, i.e., that spacetime is exactly de- 
scribed by the Friedmann-Robertson- Walker metric. The 
easiest way to experimentally rule this out is to stand 
on your bathroom scale and note the gravitational force 
pulling you down. In this particular branch of the wave- 
function there is a planet beneath you, pulling you down- 
ward, and it is irrelevant that there are other decohered 
branches of the wavefunction where the planet is instead 
above you, to your left, to your right, etc., giving an av- 
erage force of zero. (T^^) is position-independent for the 
quantum field density matrix corresponding to the total 
state, whereas the relevant density matrix is the one that 
is conditioned on your perceptions thus far, which include 
the observation that there is a planet beneath you. 



The interesting question regarding equation ( 36 ) thus 



becomes more nuanced: when exactly is it a good approx- 
imation? In this spirit, [75] poses two questions: "How 
unreliable are expectation values?" and How much spatial 
variation should one expect? We have seen above that the 
first step toward a correct treatment is to compute the 
density matrix conditioned on our observations (the third 
dynamic process in Table 1) and use this density matrix 
p to describe the quantum state. Having done this, the 



question of whether equation (36) is accurate basically 



boils down to the question of whether the quantum state 
is roughly ergodic, i.e., whether a small-scale spatial aver- 
age of a typical classical realization is well-approximated 
by the quantum ensemble average {T^u) = tr [pT^jy]. This 
ergodicity tends to hold for many important cases, in- 
cluding the inflationary case where the quantum wave- 
functional for the primordial fields in our Horizon vol- 
ume is roughly Gaussian, homogeneous and isotropic [M] . 
Spatial averaging on small scales is relevant because it 
tends to have little effect on the gravitational field on 
larger spatial scales, which depends mainly on the large- 



scale mass distribution, not on the fine details of where 
the mass is located. For a detailed modern treatment 
of small-scale averaging and its interpretation as "inte- 
grating out" UV degrees of freedom, see [75]. Since very 
large scales tend to be observable and very small scales 
tend to be unobservable, a useful rulc-of-thumb in many 
situations is "condition on large scales, trace out small 
scales" . 

In summary, the popular approximation of equa- 
tion (36) is accurate if both of these conditions hold: 



1. The spatially fluctuating stress-energy tensor for a 
generic branch of the wavefunction can be approx- 
imated by its spatial average. 

2. The quantum ensemble average can be approxi- 
mated by a spatial average for a generic branch 
of the wavefunction. 



2. Dark energy from superhorizon quantum fluctuations? 

The discovery that our cosmic expansion is accelerat- 
ing has triggered a flurry of proposed theoretical expla- 
nations, most of which involve some form of substance or 
vacuum density dubbed dark energy. An alternative pro- 
posal that has garnered significant interest is that there 
is no dark energy, and that the accelerated expansion is 
instead due to gravitational back-reaction from inflation- 
ary density perturbations on scales much larger than our 
cosmic horizon [771 [ZH] • This was rapidly refuted by a 
number of groups [79 ^82], and a related claim that su- 
perhorizon perturbations can explain away dark energy 
[55] was rebutted by [M] . 

Although these papers mention quantum mechanics 
perfunctorily at best (which is unfortunate given that the 
origin of inflationary perturbations is a purely quantum- 
mechanical phenomenon), a core issue in these refuted 
models is precisely the one we have emphasized in this 
paper: the importance of using the correct density ma- 
trix, conditioned on our observations, rather than a total 
density matrix that implicitly involves incorrect averag- 
ing — either quantum "ensemble" averaging as in equa- 
tion ( [36| ) or spatial averaging. For example, as explained 
in [84 , a problem with the effective stress-energy tensor 
(Tfiv) of [S3] is that it involves averaging over regions of 
space beyond our cosmological horizon, even though our 
observations are limited to our backward lightcone. 

Such unjustified spatial averaging is the classical 
physics equivalent of unjustified use of the full density 
matrix in quantum mechanics: in both cases, we get cor- 
rect statistical predictions only if we predict the future 
given what we know about the present. Classically, this 
corresponds to using conditional probabilities, and quan- 
tum mechanically this corresponds to conditioning the 
density matrix using the bottom equation of Table 1 — 
neither is optional. In classical physics, you shouldn't ex- 
pect to feel comfortable in boiling water full of ice chunks 
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just because the spatially averaged temperature is luke- 
warm. In quantum mechanics, you shouldn't expect to 
feel good when entering water that's in a superposition 
of very hot and very cold. Similarly, if there is no dark 
energy and the total quantum state p of our spacetime 
corresponds to a superposition of states with different 
amplitudes for superhorizon modes, then we shouldn't 
expect to perceive a single semiclassical spacetime that 
accelerates (as claimed for some models [771 [IS]), but 
rather to perceive one of many semiclassical spacetimes 
from a decohered superposition, all of which decelerate. 

Dark energy researchers have also devoted significant 
interest to so-called phantom dark energy, which has an 
equation of state w < —1 and can lead to a "big rip" a 
finite time from now, when the dark energy density and 
the cosmic expansion rate becomes infinite, ripping apart 
everything we know. The same logical flaw that we high- 
lighted above would apply to all attempts to derive such 
results by exploiting infrared logarithms in the equations 
for density and pressure [5S] if they give w < — 1 on scales 
much larger than our cosmic horizon, or more generally 
to talking about "the equation of state of a superhorizon 
mode" without carefully spelling out and justifying any 
averaging assumptions made. 



C. Unitary thermodynamics and the Copenhagen 
Approximation 

In summary, we have analyzed cosmology assuming 
unitary quantum mechanics, using a tripartite partition 
into system, observer and environment degrees of free- 
dom. We have seen that this generalizes the second law 
of thermodynamics to "The system's entropy can't de- 
crease unless it interacts with the observer, and it can't 
increase unless it interacts with the environment" . Quan- 
titatively, the system ("object") density matrix evolves 
according to one of the three equations listed in Table 1 
depending on whether the main interaction of the system 
is with itself, with the environment or with the observer. 
The key results in this paper follow from the third equa- 
tion of Table 1, which gives the evolution of the quantum 
state under an arbitrary measurement or state prepara- 
tion, and can be thought of as a generalization of the 
POVM (Positive Operator Valued Measure) formalism 

[sniiiT]. 

Informally speaking, the entropy of an object decreases 
while you look at it and increases while you don't [43] . 
The common claim that entropy cannot decrease simply 
corresponds to the approximation of ignoring the subject 
in Figure [2| i.e., ignoring measurement. Decoherence is 
simply a measurement that you don't know the outcome 
of, and measurement is simply entanglement, a transfer 
of information quantum information about the system: 
the decoherence effect on the object density matrix (and 
hence the entropy) is identical regardless of whether this 
measurement is performed by another person, a mouse, 
a computer or a single particle that encodes information 



about the system by bouncing off of it.^ In other words, 
observation and decoherence both share the same first 
step, with another system obtaining information about 
the object — the only difference is whether that system 
is the subject or the environment, i.e., whether the last 
step is conditioning or partial tracing: 

• observation ~ entanglement + conditioning 

• decoherence ~ entanglement + partial tracing 

Our formalism assumes only that quantum-mechanics 
is unitary and applies even to observers — i.e., we as- 
sume that observers are physical systems too, whose con- 
stituent particles obey the same laws of physics as other 
particles. The issue of how to derive Born rule proba- 
bilities in such a unitary world has been extensively dis- 
cussed in the literature [351 SSI 1551 - 15^ — for thorough 
criticism and defense of these derivations, see [531 ISl]j 
and for a subsequent derivation using inflationary cos- 
mology, see [IS]. The key point of the derivations is 
that in unitary cosmology, a given quantum measurement 
tends to have multiple outcomes as illustrated in Fig- 
ure [l] and that a generic rational observer can fruitfully 
act as if some non- unitary random process ( "wavefunc- 
tion collapse" ) realizes only one of these outcomes at the 
moment of measurement, with a probabilities given by 
the Born rule. This means that in the context of unitary 
cosmology, what is traditionally called the Copenhagen 
Interpretation is more aptly termed the Copenhagen Ap- 
proximation: an observer can make the convenient ap- 
proximation of pretending that the other decohered wave 
function branches do not exist and that wavefunction col- 
lapse does exist. In other words, the approximation is 
that apparent randomness is fundamental randomness. 

In summary, if you are one of the many observers in 
Figure[l] you compute the density matrix p with which to 
best predict your future from the full density matrix by 
performing the two complementary operations summa- 
rized in Table 1: conditioning on your knowledge (gener- 
alized "state preparation") and partial tracing over the 
environment.^ 



As described in detail, e.g., |48H52I . decoherence is not simply 
the suppression of off-diagonal density matrix elements in gen- 
eral, but rather the occurrence of this in the particular basis of 
relevance to the observer. This basis is turn determined dynam- 
ically by decoherence of both the object 48 - 52 and the subject 

iiaiiHi. 

® Note that the factorization of the Hilbert space into subject, ob- 
ject and environment subspaces is different for different branches 
of the wavefunction, and that generally no global factorization 
exists. If you designate the spin of a particular silver atom to be 
your object degree of freedom in this branch of the wavefunction, 
then a copy of you in a branch where planet Earth (including you, 
your lab and said silver atom) are a light year further north will 
settle on a different tripartite partition into subject, object and 
environment degrees of freedom. Fortunately, all observers here 
on Earth here in this wavefunction branch agree on essentially 
the same entropy for our observable universe, which is why we 
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D. Outlook 

Using our tripartite decomposition formalism, we 
showed that because of the long-range entanglement cre- 
ated by cosmological inflation, the cosmic entropy de- 
creases exponentially rather than linearly with the num- 
ber of bits of information observed, so that a given ob- 
server can produce much more negentropy than her brain 
can store. Using this result, we argued that as long as 
inflation has occurred in a non-negligible fraction of the 
volume, almost all sentient observers will find themselves 
in a post-inflationary low-entropy Hubble volume, and we 
humans have no reason to be surprised that we do as well, 
which solves the so-called inflationary entropy problem. 
As detailed in Section |Xj an arguably worse problem for 
unitary cosmology involves gamma-ray-burst constraints 
on the "Big Snap" , a fourth cosmic doomsday scenario 
alongside the "Big Crunch" , "Big Chill" and "Big Rip" , 
where an increasingly granular nature of expanding space 
modifies our effective laws of physics, ultimately killing 
us. 

Our tripartite framework also clarifies when the pop- 
ular quantum gravity approximation G^jy ~ 8TrG{T^^) is 
valid, and how problems with recent attempts to explain 
dark energy as gravitational backreaction from super- 
horizon scale fluctuations can be understood as a failure 
of this approximation. In the future, it can hopefully 
shed light also on other thorny issues involving quantum 
mechanics and macroscopic systems. 
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Appendix A: The Degree-of-Freedom Problem and 
the Big Snap 

Let N denote the number of degrees of freedom in a fi- 
nite comoving volume V of space. Does N stay constant 
over time, as our universe expands? There are three log- 
ically possible answers to this questions, none of which 
appears problem free: 

1. Yes 

2. No 

3. N is infinite, so we don't need to give a yes or no 
answer. 



tend to get a bit sloppy and hubristically start talking about 
"the" entropy, as if there was sueh a thing. 



Option 3 has been called into doubt by quantum grav- 
ity considerations. First, the fact that our classical notion 
of space appears to break down below the Planck scale 
Tpi ~ 10~'^^m calls into question whether N can signifi- 
cantly exceed V/r^^, the volume V that we are consider- 
ing, measured in Planck units. Second, some versions of 
the so-called holographic principle [M] suggests that N 
may be smaller still, bounded not by the V/r'^^ but by 

V^^^/r^i, roughly the area of our volume in Planck units. 
Let us therefore explore the other two options: 1 and 2. 
The hypothesis that degrees of freedom are neither cre- 
ated nor destroyed underlies not only quantum mechanics 
(in both its standard form and with non-unitary GRW- 
likc modifications [33] )j but classical mechanics as well. 
Although quantum degrees of freedom can freeze out at 
low temperatures, reducing the "effective" number, this 
does not change the actual number, which is simply the 
dimensionality of the Hilbert space. 



a. Creating degrees of freedom 

The holographic principle in its original form |96j sug- 
gests option 2, changing A^.^ Let us take our comoving 
volume V to be our current horizon volume, also known 
as our "Hubble volume" or our "observable universe" , of 
radius ^ lO^^m, giving a holographic bound of A^ ^ 10^^" 
degrees of freedom. This exact same comoving volume 
was also the horizon volume during infiation, at the spe- 
cific time when the largest-scale fluctuations imaged by 
the WMAP-satellite [8] left the horizon, but then its ra- 
dius was perhaps of order 10~^^m, giving a holographic 
bound of a measly A^ ~ 10^^ degrees of freedom. Since 
this number is ridiculously low by today's standards (I 
have more bits than that even on my hard disk) , new de- 
grees of freedom must have been created in the interim 
as per option 2.® But then we totally lack a predictive 
theory of physics! To remedy this, we would need a the- 
ory predicting both when and where these new degrees 
of freedom are created, and also what quantum states 
they are created with. Such a theory would also need 
to explain how degrees of freedom disappear when space 
contracts, as during black hole formation. Although some 
interesting early work in this direction has been pursued 
(see e.o. |100j V it appears safe to say that no complete 
self-consistent theory of this type has yet been proposed 
that purports to describe all of physical reality. 



More recent versions of the holographic principle have focused on 
the entropy of 3D light-sheets rather than 3D volumes, evading 
the implications below lOTI 1981 . 

An even more extreme example occurs if a Planck-scale region 
with a mere handful of degrees of freedom generates a whole new 
universe with say 10^^*^ degrees of freedom via the Farhi-Guth- 
Guven mechanism I99| . 
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b. The Big Snap 

This leaves option 3, constant N. It too has received 
indirect support from quantum gravity research, in this 
case the AdS/CFT correspondence, which suggests that 
quantum gravity is not merely degree-of-freedom preserv- 
ing but even unitary. This option suffers from a different 
problem which I have emphasized to colleagues for some 
time, and which I will call the Big Snap. 

If N remains constant as our comoving volume expands 
indefinitely, then the number of degrees of freedom per 
unit volume drops toward zero^ as N/V. Since a rubber 
band consists of a finite number of atoms, it will snap 
if you stretch it too much. Similarly, if our space has a 
finite number of degrees of freedom N and is stretched 
indefinitely, something bad is guaranteed to happen even- 
tually. 

As opposed to the rubber band case, we do not know 
precisely what this "Big Snap" will be like or precisely 
when it will occur. However, it is instructive to consider 
the length scale a = {V/NY^'^: if the degrees of freedom 
are in some sense rather uniformly distributed through- 
out space, then a can be thought of as the characteristic 
distance between degrees of freedom, and we might ex- 
pect some form of spatial granularity to manifest itself on 
this scale. As the universe expands, a grows by the same 
factor as to the cosmic scale factor, pushing this gran- 
ularity to larger scales. It is hard to imagine business 
as usual once a > lO^^m so that the number of degrees 
of freedom in our Hubble volume has dropped below 1. 
However, it is likely that our universe will become unin- 
habitable long before that, perhaps when the number of 
degrees of freedom per atom drops below 1 (a > l~^'^m, 
altering atomic physics) or the number of degrees of free- 
dom per proton drops below 1 (a > l^^^m, altering nu- 
clear physics). This Big Snap thus plays a role similar 
to that of the cutoff hypersurface used to tackle the in- 
flationary measure problem, endowing the "end of time" 
proposal of [104j with an actual physical mechanism. 

Fortunately, there are observational bounds on many 
types of spatial granularity from astronomical observa- 
tions. For a simple lattice with spacing a, the linear 
dispersion relation cj(fc) — ek for light gets replaced by 
w(fc) oc sin(afc), giving a group velocity 



as long as a <^ k~^. This means that if two gamma- 
ray photons with energies Ei and E2 are emitted si- 
multaneously a cosmological distance e/H away, where 
~ lO^^s is the Hubble time, they will reach us sep- 
arated in time by an amount 



Av 



H' 



aAE 
he 



(A2) 



if the energy difference AE = \E2 — Ei\ is of the same 
order as Ei. Structure on a time-scale of 10^'*s has been 
reported in the gamma-ray burst GRB 910711 |105j in 
multiple energy bands, which 106, interpret as a lower 
bound At < 0.01 s for AE = 200 keV. Substituting this 



into equation ( A2 ) therefore gives the constraint 



a < flGRB 



he 
AE 



(HAt) 



1/2 



10 



-21 , 



(A3) 



If N really is finite, then we can consider the fate of 
a hypersurface during the early stages of inflation that 
is defined by a = a* for some constant a*. Each region 
along this hypersurface has its own built-in self-destruct 
mechanism, in the sense that it can only support ob- 
servers like us until it has expanded by a factor Oj /a* , 
where a-f is the a-value beyond which life as we know it 
is impossible. However, in the eternal inflation scenario, 
which has been argued to be generic [56l - f58] . different 
regions will expand by different amounts before inflation 
ends, so we should expect the probability to find our- 
selves in a given region ^ 10^^ seconds after the end of 
inflation to be proportional to {a/a■^,)^ as long as a < a|, 
i.e., proportional to the volume of the region and hence 
to the number of solar systems in the region (at least for 
all regions that share our effective laws of physics). This 
predicts that generic observers should have a drawn from 
the probability distribution 



/(«) 



^ if a < at, 
if a > flj. 



(A4) 



The tight observational constraints in equation ( A3 1 are 



thus very surprising: even if we conservatively assume 
flj = 10~^^m, i.e., that a needs to be 10000 times smaller 
than a proton for us to survive, the probability of observ- 
ing a < acRB is merely 



dto 
dk 



cos a/c ~ 1 — 



(ak)' 



1 faEV 



(Al) 



^ Some interesting models evade this conclusion by denying that 
the physically existing volume can ever expand indefinitely while 
remaining completely "real" in some sense. De Sitter Equilib- 
rium cosmology |101l I102| can be given the radical interpreta- 
tion that once objects leave our cosmic de Sitter horizon, they 
no longer have an existence independent of what remains inside 
our horizon, and some holographic cosmology models have re- 
lated interpretations I103| . 



P{a < cgrb) 



f{a)da = 



ogrb 
at 



10" 



(A5) 

thus ruling out this scenario at 99.999999% confidence. 

This argument should obviously be taken with a grain 
of salt; for example, one can imagine alternative disper- 
sion relations which weaken the bound in equation ( A3 1 . 



However, to be acceptable, any future theory predicting 
a finite unchanging number of degrees of freedom TV must 
repeat this calculation using its own formalism and suc- 
cessfully explain why we do not observer greater time 
dispersion in gamma-ray bursts. 
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Another important caveat is that our space is not 
expanding uniformly: indeed, gravitationally bound re- 
gions Uke our Galaxy are not expanding at all. In spe- 
cific models where the degrees of freedom are localized 
on spatial scales smaller than galaxies, one could imag- 
ine galaxy-dwelling observers happily surviving long af- 
ter intergalactic space has undergone a big snap, as long 
as deleterious effects from these faraway regions do not 
propagate into the galaxies. Note, however, that this sce- 
nario saves only the observers, not the underlying theory. 



Indeed, the discrepancy between theory and observation 
merely gets worse: repeating the above volume weighting 
argument now predicts that we are most likely to find 
ourselves alive and well in a galaxy after the big snap 
has taken place throughout most of space, so the lack 
of any strange observed signatures in light from distant 
extragalactic sources (energy-dependent arrival time dif- 
ferences for gamma-ray bursts, say) becomes even harder 
to explain. 
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